Corpus: swa_wikipedia_2014

Other corpora

3.6.3 Zipf's law for words with same first letter

Zipf's law restricted to words with first letter a, b, c, and d


Zipf's diagram for words of fixed length


Gnuplot diagram

Top words a-
rank frequency word
1 6896 au
2 2763 ambayo
3 2559 alikuwa
4 1874 aina
5 1781 ambao
Top words b-
rank frequency word
1 2742 baada
2 1238 bila
3 1137 baadaye
4 947 bora
5 929 biashara
Top words c-
rank frequency word
1 10515 cha
2 2409 chini
3 946 chake
4 763 chakula
5 562 chama
Top words d-
rank frequency word
1 1295 dhidi
2 1270 damu
3 1210 duniani
4 1107 dunia
5 890 dawa
97 msec needed at 2018-01-22 13:07